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TWO  ITEMS  CONCERNING  DIRECTIONAL  DATA 


by 

M  A.  Stephens 

In  this  paper  we  consider  two  topics  connected  with  directional 
data,  i - e, ,  observations  which  may  be  recorded  by  unit  vectors  OP, 
from  the  center  0  of  a  circle  or  sphere  of  radius  1,  to  points 
on  the  circumference  or  surface,  Alternatively,  the  point  P  on  a 
circle  may  record  an  observation,  such  as  the  occurrence  nf  an  event 
during  a  period  whose  length  is  represented  by  the  total  ci rcumference. 

When  the  observations  are  clustered  around  a  central  direction, 
the  von  Mises  distribution,  on  the  circle,  or  the  Fisher  distribution, 
on  the  sphere,  are  used  to  describe  the  data  These  are  unimodaL 
probability  distributions,  with  density  on  the  surface  proportional 
to  exp(<cosa),  where  k  is  a  concentration  parameter  and  a  the 
angle  between  OP  and  the  central  (modal)  vector  &  It  will  be 
assumed  that  the  reader  is  familiar  with  these  dist ribut ions>  the 
references  given  in  the  text  may  be  used  as  sources  for  earlier  references 

The  two  topics  to  be  discussed  in  the  next  two  sections  are;  In 
section  2>  A  discussion  of  the  standard  tests  and  confidence  Interval 
procedures  for  the  modal  vector  A  of  a  von  Mises  or  a  Fisher  distribu¬ 
tion,  when  the  concentration  paraketer  k  is  not  known,  It  Is  not 
generally  noted  that  the  usual  test  Is  strictly  for  the  axis  along 
which  A  lies,  without  specifying  the  direction,  and  this  necessitates 
a  slight  revision  of  confidence  Levels, 
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In  section  3*  Significance  point s  are  provided  for  the  likelihood- 
ratio  test  for  A  for  a  Fisher  distribution,  for  the  case  when  the 
concentration  parameter  k  i£  known.  The  test  will  replace  a  test 
already  suggested  elsewhere,  and  indications  are  that  an  approximation 
will  hold  well  for  the  von  Mises  distribution  also. 

2 .  Teats  and  Confidence  Intervals  for  the  Modal  Vector  of  a 
von  Misea  or  Fisher  Distribution. 

2.1.  The  von  Mises  and  Fisher  distributions  have  densities  propor¬ 
tional  to  exp( /ccosct) ,  where  a  is  the  angle  between  a  sample  vector 
OP.  and  the  modal  vector  A,  and  k  is  a  concentration  parameter.  For 
r  a  test  of  Hq:  A  is  along  a  given  A^  when  k  is  not  known,  Watson 
and  Williams  (19%)  proposed  a  conditional  test,  both  for  the  circle-  and 
the  sphere.  Suppose  a  sample  of  N  unit  vectors  gives  a  resultant  vector 
R,  length  R,  and  let  its  component  on  A  be  X>  the  test  depends 
on  the  fact  that  the  conditional  distribution  of  R,  given  X,  is 
independent  of  k .  Critical  values  RQ,  for  given  N,  X,  and  a,  have 
been  given  by  Stephens  ( 1962a, b)  in  the  form  of  charts  of  Rq  against 
X>  if  R  exceeds  Rq,  the  null  hypothesis  is  rejected. 

The  distribution  of  R,  given  X,  is  in  fact  the  same  for  X 
positive  or  negative,  and  if  a  given  A„  is  acceptable  when  the 
component  of  R  on  A^  is  positive,  the  vector  -A^  would  be  equally 
acceptable,  the  component  being  now  negative.  Thus  the  test  is  strictly 
a  test  for  the  modal  axis,  without  direction.  Thi6  will  become  important 

whan  we  use  critical  values  R  of  R  to  obtain  a  confidence  interval 

o 

for  A.  The  use  of  the  charts  both  for  testing  and  for  confidence 
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intervals,  can  best  be  illustrated  with  the  help  of  a  diagram  In 
Figure  1  we  have  taken  the  case  of  a  sample  of  size  N  -  20,  drawn 
from  the  von  Mines  distribution, 

Suppose  OX  points  in  the  direction  of  the  modal  vector  A,  and 
let  OY  be  the  axis  at  right  angLes..  For  every  sample  with  resultant 
R,  we  can  calculate  Y  ~  £  and  set  C  s  (X,Y),  OC  is  the 

resultant  R,  and  C  represents  the  sample  on  the  diagram.  C  lies 
inside  the  circle,  center  0,  radius  20.  Let  R  be  the  critical 
value  of  R  for  given  X,  at  the  5$  level,  given  by  +he  charts,  and 
draw  the  curve  T  =  R2 -X2 ■,  call  this  the  b4>  crlt icai  limit  .  This 
is  shown  in  the  illustration,  and  also  part  of  the  1$  critical  limit 
obtained  from  1$  critical  values  of  R.  The  curves  are  symmetrical 
about  OY  a c  well  as  OX.  Since  for  every  X  the  probability  of  C 
falling  outside  the  curve  shown  In  0,05,  with  Y  either  positive  or 
negative,  the  probability  of  failing  above  the  upper  curve  is  0  025 
and  of  falling  be 'low  the  lower  curve  is  also  0.02  b,  whatever  the  value 
of  /c.  Suppose  now  OX  has  coordinate  measured  anticlockwise 

from  a  suitable  initial  line  OB,  if  the  true  modal  vector  were 
along  OX'  with  angle  0.^  >  dQ>  more  than  2 . 5$  of  sample  points  C 
would  fall  above  the  upper  2 .5$  curve .  In  the  same  way,  if  0^  <  Qq 
an  excess  of  sample  points  will  Lie  below  the  lower  2,5$  critical 
boundary-  As  a  result,  a  test  of  the  hypothesis,  Hq,  that  the 
direction  of  the  modal  vector  is  q  -  0  ,  against  the  one-sided 
alternative  that  0c  >  0  ,  consists  in  (a)  Calculating  X  and  R  from 
the  sample,  (b)  finding  from  the  charts  in  Stephens  (,1962a)  the  critical 
value  of  R(  .05)  |X  (or  R( , 01 1 X ) )  and  (c)  rejecting  at  the 


2.5$(or  0.5$)  level,  if  the  observed  R  >  Rq.  There  will  be  a  corres¬ 
ponding  one-sided  test  against  the  alternative  $c  <  0Q,  and  if  the 
alternative  is  simply  &c  6q)  then  the  steps  above  are  followed 
but,  in  ( c) ,  H0  is  rejected,  if  R  is  too  large,  at  the  5$ 

(or  1$)  level.  For  N,  X  not  given  in  the  chertB,  Stephens  (1962a) 
has  given  several  approximations.  Similar  charts  and  approximations 
for  use  with  the  Fisher  distribution,  are  given  in  Stephens  (1962b). 

2.2  Confidence  limits  for  9  . 

c 

Suppose  that  the  true  modal  vector  has  direction  ec  =  9Q,  but 

that  we  are  unaware  of  this  and  must  estimate  9  from  the  direction 

c 

of  the  sample  vector  resultant.  Figure  1  shows  two  possible  resultants 
OCi,  0C2,  at  angles  respectively,  both  of  the  same  length 

R  =  7.90.  We  find  from  Stephens'  tables  that  for  N  *  20  the 
corresponding  abscissa  for  a  =  0.05,  is  X  *  5.0,  and  note  that 

<p  «  cob"1(x|r)  =  cos-1  =  cos“1(5.0/7.9)  -  cos"1(.6?29)  =  50.7°. 

<p  is  clearly  the  angle  between  OZ  and  OX,  where  OZ  is  7. 9  and. 

Z  is  on  the  5$  critical  limit. 

Suppose  the  lower  confidence  limit  for  0  is  obtained  by  sub- 

c 

tracting  <p  from  the  angle  of  R*  the  two  illustrations  then  give 
limits  q1-50. 7°  and  02-50.7°.  In  the  first  example  this  limit 
gives  a  confidence  interval  which  does  not  include  the  true  modal 
vector,  along  OX,  and  this  is  clearly  because  the  point  is  above 

'the  5$  critical  boundary*  while  the  interval  baaed  on  OCg  does  include 
OX.  because  Cg  is  below  the  (5$  boundary.  Since  2.5$  of  samples 
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and  97-5$ 


would  give  a  sample  point  above  the  boundary,  .'Like  C^, 
below,  like  C^,  the  procedure  clearly  provides  h.  lower  97.^;$  confi¬ 
dence  limit  for  0^,  whatever  is  the  value  of  k  .  Similarly,  if 
we  add  cp  =  50.7°  to  the  angle  of  the  sample  resultant  vector  we  shall 
have  an  upper  97 '5$  confidence  limit  for  0^,  and  the  t_wo  limits  vi.U 
define  a  central  95$  confidence  interval  for  q  ,  A  similar  result 
holds  for  the  left  side  of  the  diagram,  where  X  is  negative  Thus 
the  interval  is  net  strictly  for  the  modal  vector  with  direc<  ion,  but 
only  for  the  axis  along  which  it  lies,  with  either  possible  direction. 

In  practice,  of  course,  except  for  k  very  small,  negative  values  of 
X  will  rarely  occur,  and  the  confidence  interval  is  always  chosen  to 
give  positive  X,  but  this  lowers  the  confidence  level.  The  amount 
by  which  it  is  lowered  is  found  as  follows.  Let  p  be  the  probability, 
for  given  K,  that  X  >  0,  i.e. ,  the  probability  that  the  sample  point 
falls  to  the  right  of  OY.  If  P  is  the  confidence  probability  (i.e., 
10QP$  is  the  confidence  level)  then  the  probability  that  the  above 
procedure  includes  the  positive  modal  vector  is  Pp .  The  relationship 
of  p  and  K,  for  p  near  1,  may  be  found  approximately  from  a 
table  of  percentage  points  of  X,  in  Stephens  (I9t>9a)..  From  this 
we  have 


N: 

.10 

10 

20 

20 

4o 

40 

P: 

0.95 

0-99 

0.95 

0.99 

0.95 

0.99 

K: 

0.74 

1.08 

0.5.' 

0-74 

0  .  V7 

0  . '  }  *> 

Thus  for  k  >  1,  the  risk  of  mistaking  the  direction  of  A  will  be 
very  small,  for  N  -  10  say,  and  the  confidence  .Lo^el  hardly  changes* 


for  larger  samples,  <  may  become  even  smaller  with  negligible  risk  of 
a  wrong  decision. 

2 .3  The  Fioher  Distribution. 

For  three-dimensions,  the  same  type  of  argument  holds,  though  a 
geometric  representation  would  now  involve  an  ellipsoid  for  the  5$ 
critical  limit.  Corresponding  to  two-sided  confidence  Intervals  for 
6  on  the  circle,  there  will  now  be  a  cone  of  confidence  for  9  on 
the  sphere*  it  would  be  difficult  to  Interpret  one-sided  testB  or 
confidence  intervals.  Again  the  test  based  on  R  for  given  X  is 
really  a  test  for  the  axis  of  A,  without  direction,  and  the  alleged 
confidence  probability  P  must  be  multiplied  by  a  p  which  depends  on 
N  and  < .  From  tables  of  the  distribution  of  X  for  given  N,  K 
given  in  Stephens  (1967)  the  table  of  values  for  p  for  the  sphere 
becomes 


N: 

10 

10 

?0 

20 

4o 

4o 

P: 

0.95 

0.99 

0.95 

0.99 

0.95 

0.99 

K : 

0.91 

1.30 

0.64 

0.91 

0.1*5 

0.64 

k.  is  required  to  be  a  little  higher  for  the  sphere  than  for  the  circle 
to  obtain  the  same  p  for  given  N. 

2,4  Numerical  illustration. 

In  Table  1  we  give,  for  N  =  10  and  20,  the  relationship  between 
E/N  and  the  critical  value  xj N  for  a  95^  confidence  interval  with 
the  Fisher  distribution*  also  the  ©c  to  which  this  corresponds. 
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Further,  F/N  is  used,  to  estimate  <  (hy  k,  the  solution  of 
cothK-l-K  «  R/N)>  the  estimate  could  then  be  used  to  estimate  p 
as  described  above.  Values  of  K  are  also  included  in  Table  1. 
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TABLE  1 


Critical  values  of  x/N  and  r>c  (to  nearest  degree)  for  95$  confidence 
interval,  given  R/N,  and  estimated  value  k  derived  from  R/n.  (Fisher 
distribution) . 


R/N: 

.44 

.45 

X/N: 

.00 

.056 

V 

90 

83 

X 

• 

1.51 

1.55 

N  =  10 


•  5 

.53 

.  6 

.241 

.342 

.431 

61 

52 

44 

1.80 

2.07 

2.4o 

.7  .8  .9 

,58?  .721  861 

34  26  17 

5.30  5-00  10.00 


N  =  20 


R/N: 

.32 

.35 

.4 

.5 

.6 

.7 

.8 

X/N: 

.00 

.16 

.29 

.42 

.53 

.65 

.77 

9c! 

90 

63 

44 

34 

28 

22 

17 

A 

K: 

1.03 

1.14 

1.34 

1.80 

2.40 

3.30 

5.00 

3 .  Likelihood-Ratio  Te3t  for  the  Modal  Vector  of  a  Fisher  Distribution 
When  the  Concentration  Parameter  is  Known. 

3.1  The  test  to  be  discussed,  in  this  section  is  similar  to  that 

in  Section  2,  but  with  the  concentration  parameter  k  assumed  known. 

This  is  probably  leBs  likely  to  occur,  but  when  it  does,  a  more  powerful 

test  of  H  can  be  mado.  The  likelihood-ratio  test  statistic  for 
o 

H  :  A  along  A  ,  against  the  alternative  H.:  that  A  In  along  a 
vector  other  than  Aq,  may  be  shown  to  be  R-X.  Thus  if  the  distribu¬ 
tion  of  Z  =  R-X  were  known,  a  test  based  on  Z,  rejecting  if  Z 
were  too  large,  should  give  a  more  powerful  test  against  the  alternative. 
The  exact  distribution  of  Z  is  difficult  to  find*  however,  for  the 
Fisher  distribution,  significance  points  can  be  found,  and  are  given  in 
Table  2.  The  test  may  formally  be  set  out  as  follows: 

(a)  Calculate  Z  =  R-X> 

(b)  In  Table  2  for  given  N,  K  and  a,  find  the  table  entry  z> 

(c)  If  Z  >  z,  reject  Hq  at  significance  level  a. 

If  K  is  too  large  for  Table  2,  solve  for  z  from 

2kz  =  >^(a)  ,  (1) 

where  ^(a)  is  the  upper  significance  point  of  at  level  a. 

3 .2  Theory  of  the  test. 

We  start  by  defining 
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and 


“(aHS rf  ■ 

sjj(t)  -  i  (")(-D8  ■  r-t-2s  >r  , 
s-0 

N  and  r  are  positive  integers,  and  |tj  <  N»  1  Re  notat  ion  <.  z  > 
means  <  z  >  -  z  if  z  >  0,  and  <  z  >  -  0  if  z  <  0.  The  joint 
density  of  R  and  X  Ls  (Stephens,  1.967) 

f-^(  R,X)  -  c(K)exp(^X)pJJ'?(R)/(N-2)  ’  ,  |X|  <  R,  0  <  R  <  (2) 

Let  y  -  R-X',  the  joint  density  of  y,R  is  then 

f?(y,R)  -  cU)ejcp(  -/<y)sxp(KR)i^'‘':,(R)/(N-2).?  } 

0<y<2R,  0  <  R  <■  N  (V) 

For  the  density  of  y  alone,  we  must  integrate  out  R,  so  we  require 

XN 

exp(*R)R{J'2(R)dR  .  ..with  V  -  y/P  - 
v 

This  Integral  J  enters  an  expression  for  an  integral  I  given  in 
Stephens  (1967,  top  of  page  2.1b)  and  may  easily  be  deduced  from  the 
result  for  I>  the  density  of  y  then  becomes 
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fj(y)  -  Kc(<)exp(  -Ky) 


N-i  rf|'r(y/2) 

exp(-K-y/2)  £  — - - 

r=2  k.  (N-r)! 


1_  2  (  exp(/<(N-2r) ) -exp(/<y/2 ) )  ( :,)( -1  V 

N  r-0 


,  0  <  y  <  2N 


Where  t  is  the  greatest  integer  less  than  (N-y/2)/2.  This  density  has 
been  integrated  numerically  to  give  the  significance  points  in  Table  2. 

The  approximation  in  step  (c)  of  Section  .5.2  may  be  deduced  from 

the  fact  that  -2  In  \  =  2k(R-X),  where  X  is  the  likelihood  ratio  for 

the  test  of  Hq,  and  so  is  asymptotically  distributed.  However, 

Watson  (1956)  first  gave  the  approximation,  and  deduced  it  from  the 

properties  of  the  Fisher  density  for  large  k  .  It  is  interesting  also  to 

see  that  it  follows  from  (2)  above,  if  k  is  large,  since  in  this 
circumstance  we  expect  large  R  and  X  and  therefore  small  values  of  y. 
Thus  if  we  replace  the  range  of  y  in  (2)  by  0  <  y  <  >*>,  and  adjust 
the  constant  c(k)  accordingly,  we  can  see  that  the  joint  density  of 
y  and  R  now  factors  into  two  components,  one  containing  only  y  and 
the  other  only  R>  with  appropriate  choice  of  constant  terms,  these 
must  be  the  marginal  densities  of  y  and  R.  For  y,  since  the  term 
with  y  is  exp(  -Ky) ,  we  see  that  <y  must  have  the  exponential 
distribution*  the  full  density  is 


f(y)  =  Kexp(-Ky)  ,  y  >  0 
and  this  is  equivalent  to 

2<y  -  •  (3) 
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These  two  approximate  derivations  of  (  J )  show  that  we  can  expect  It  to  be 
a  good  approximation:  (a)  when  K  is  iarge,  for  all  N,  and  (  b)  for 
smaller  values  of  k,  as  N  -*■*.  The  values  given  by  the  approximation 
are  recorded  in  Table  P  opposite  N  -  «.>  the  exact  values  given  for 
other  values  of  k  show  that  the  above  expectations  are  borne  out 
extremely  well. 

3 .3  Fowe  r  c  ompar i s  on  s ■ 

The  test  of  H  above  can  aiso  be  mde  using  X  alone,  since  its 
o 

distribution  for  given  N  and  k  is  known  and  significance  points  tabu¬ 
lated.  This  suggestion  was  given  in  Stephens  (1967),  but  was  wrongly 
described  as  a  two-tailed  test,  in  fact,  would  be  rejected  only  if 

X  is  too  small  for  given  N  and  k.  We  now  compare  the  two  tests 
for  power,  not  directly,  but  indirectly  by  comparing  the  confidence 
intervals  for  A  obtained  by  the  two  methods.  Suppose  A  is  at  angle 
0  to  R>  a  ( l-a)$  confidence  interval  for  9  is  0  <  S  <  Qc>  where 

9  is  the  solution  of  R  cos  Q  -  X  ,  and  X  is  the  critical  value 

c  c  c  c 

of  X  found  in  either  test,  at  significance  level  a- 

Consider  an  example,  when  N  =  10  and  k  =  4  The  5$  critical 

value  of  R-X  is  found  from  step  (c),  the  value  is  0.730,  so 

X  =  R-O.750.  For  the  test  based  on  X  alone,  X  =  6..  10  (from  Table  1, 
c  c 

Stephens  (1967))-  Thus  the  R-X  test  will  give  smaller  confidence 
band  for  9  whenever  R-0.750  >  6,10,  i  e,,  whenever  R  >  6.85  ~  RQ 
say.  From  Table  2  of  Stephens  (196/')  the  probability  Pr(R  >  R  )  can 
roughly  be  estimated^  for  N  10,  k  -  H,  ’he  lower  1$  and  5$  values 
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of  R  are  6.42  and  5.71  and  Pr(R  >  6.85)  is  near  0.9.  Thus  of 
the  time  the  R-X  teat,  for  N  =  10,  k  -  4,  would  give  greater  power 
than  the  test  based  on  X  alone,  for  N  =  10,  and  other  values  of  «, 
the  value  of  Rq  is  given,  and  a  rough  estimate  of  the  probability  p 
of  exceeding  It,  In  the  following  small  table: 


<: 

1 

15 

2 

2.5 

3 

4 

n  : 

0 

3 .21 

'5.78 

4.56 

5.29 

5-92 

0.85 

P: 

0.5 

0.7 

0.75 

0.85 

0.88 

0.90 

p  represents  the  probability  that  the  Z-test  Is  better  than  the  X-test. 
The  same  pattern  is  repeated  if  one  makes  the  calculations  for  N  =  20. 
Clearly  the  Z-test  Is  on  the  whole  a  better  test. 

3 .4  Corresponding  teat  for  the  circle. 

For  the  von  Mises  distribution  also,  Z  =  R-X  is  the  likelihood- 
ratio  test  statistic  for  H  ;  the  exact  distribution  cannot  be  handled, 
but  the  asymptotic  result  is  2k(R-X)  -  ,  so  that  critical  values  z 

of  Z  would  be  given  by 

2kz  -  ^(a) 

Monte  Carlo  results  suggest  that  this  approximation  will  hold  with 
in  a  manner  similar  to  that  for  the  sphere,  i.e.,  for  K  >2,  and  N  >  20, 
and,  for  larger  k,  even  for  N  <  20. 
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I  TABLE  2.  Critical  Values  of  R-X,  for  Test  of  Model  Vector,  Sphere. 


In  descending  order  the  values  are  for  ot  -  .01,  .025»  .05,  *10 
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W 


00 

9-210  ' 

4.605 

5.070 

2.502 

1.842 

1.555 

7-578 

>.689 

2.460 

1.844 

1.476 

I.250 

5.991 

2.996 

1.9S7 

1.498 

1.198 

0.999 

4.605 

2.505 

1.555 

1.152 

0.922 

0.768 
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